836 research outputs found

    Discrete Prompt Compression with Reinforcement Learning

    Full text link
    Instruction-tuned Language Models (LMs) are widely used by users to address various problems with task-specific prompts. Constraints associated with the context window length and computational costs encourage the development of compressed prompts. Existing methods rely heavily on training embeddings, which are designed to accommodate multiple token meanings. This presents challenges in terms of interpretability, a fixed number of embedding tokens, reusability across different LMs, and inapplicability when interacting with black-box APIs. This study proposes prompt compression with reinforcement learning (PCRL), a novel discrete prompt compression method that addresses these issues. PCRL employs a computationally efficient policy network that directly edits prompts. The PCRL training approach can be flexibly applied to various types of LMs, as well as decoder-only and encoder-decoder architecture, and can be trained without gradient access to LMs or labeled data. PCRL achieves an average reduction of 24.6% in token count across various instruction prompts while preserving performance. Further, we demonstrate that the learned policy can be transferred to larger LMs, and through various analyses, we aid the understanding of token importance within prompts

    Evolving speciated checkers players with crowding algorithm

    Get PDF
    Abstract-Conventiona1 evolutionary algorithms have a property that only one solution often dominates and it is sometimes useful to find diverse solutions and combine them because there might be many different solutions to one problem in real world problems. Recently, developing checkers player using evolutionary algorithms has been widely exploited to show the power of evolution for machine learning. In this paper, we propose an evolutionary checkers player that is developed by a speciation technique called crowding algorithm. In many experiments, our checkers player with ensemble structure shows better performance than non-speciated checkers players

    Bacterial community analysis in upflow multilayer anaerobic reactor (UMAR) treating high-solids organic wastes

    Get PDF
    A novel anaerobic digestion configuration, the upflow multi-layer anaerobic reactor (UMAR), was developed to treat high-solids organic wastes. The UMAR was hypothesized to form multi-layer along depth due to the upflow plug flow; use of a recirculation system and a rotating distributor and baffles aimed to assist treating high-solids influent. The chemical oxygen demand (COD) removal efficiency and methane (CH4) production rate were 89% and 2.10 L CH4/L/day, respectively, at the peak influent COD concentration (110.4 g/L) and organic loading rate (7.5 g COD/L/day). The 454 pyrosequencing results clearly indicated heterogeneous distribution of bacterial communities at different vertical locations (upper, middle, and bottom) of the UMAR. Firmicutes was the dominant (>70%) phylum at the middle and bottom parts, while Deltaproteobacteria and Chloroflexi were only found in the upper part. Potential functions of the bacteria were discussed to speculate on their roles in the anaerobic performance of the UMAR system

    The interplay between real and pseudo magnetic field in graphene with strain

    Full text link
    We investigate electric and magnetic properties of graphene with rotationally symmetric strain. The strain generates large pseudo magnetic field with alternating sign in space, which forms a strongly confined quantum dot connected to six chiral channels. The orbital magnetism, degeneracy, and channel opening can be understood from the interplay between real and pseudo magnetic field which have different parities under time reversal and mirror reflection. While the orbital magnetic response of the confined state is diamagnetic, it can be paramagnetic if there is an accidental degeneracy with opposite mirror reflection parity

    Recent Advances in General Game Playing

    Get PDF
    The goal of General Game Playing (GGP) has been to develop computer programs that can perform well across various game types. It is natural for human game players to transfer knowledge from games they already know how to play to other similar games. GGP research attempts to design systems that work well across different game types, including unknown new games. In this review, we present a survey of recent advances (2011 to 2014) in GGP for both traditional games and video games. It is notable that research on GGP has been expanding into modern video games. Monte-Carlo Tree Search and its enhancements have been the most influential techniques in GGP for both research domains. Additionally, international competitions have become important events that promote and increase GGP research. Recently, a video GGP competition was launched. In this survey, we review recent progress in the most challenging research areas of Artificial Intelligence (AI) related to universal game playing

    Charge Transfer Induced Molecular Hole Doping into Thin Film of Metal-Organic-Frameworks

    Full text link
    Despite the highly porous nature with significantly large surface area, metal organic frameworks (MOFs) can be hardly used in electronic, and optoelectronic devices due to their extremely poor electrical conductivity. Therefore, the study of MOF thin films that require electron transport or conductivity in combination with the everlasting porosity is highly desirable. In the present work, thin films of Co3(NDC)3DMF4 MOFs with improved electronic conductivity are synthesized using layer-by-layer and doctor blade coating techniques followed by iodine doping. The as-prepared and doped films are characterized using FE-SEM, EDX, UV/Visible spectroscopy, XPS, current-voltage measurement, photoluminescence spectroscopy, cyclic voltammetry, and incident photon to current efficiency measurements. In addition, the electronic and semiconductor property of the MOF films are characterized using Hall Effect measurement, which reveals that in contrast to the insulator behavior of the as-prepared MOFs, the iodine doped MOFs behave as a p-type semiconductor. This is caused by charge transfer induced hole doping into the frameworks. The observed charge transfer induced hole doping phenomenon is also confirmed by calculating the densities of states of the as-prepared and iodine doped MOFs based on density functional theory. Photoluminescence spectroscopy demonstrate an efficient interfacial charge transfer between TiO2 and iodine doped MOFs, which can be applied to harvest solar radiations.Comment: Main paper (19 pages, 6 figures) and supplementary information (15 pages, 10 figures), accepted in ACS Appl. Materials & Interface
    corecore